Sequential anomaly detection based on temporal-difference learning: Principles, models and case studies
نویسنده
چکیده
Anomaly detection is an important problem that has been popularly researched within diverse research areas and application domains. One of the open problems in anomaly detection is the modeling and prediction of complex sequential data, which consist of a series of temporally related behavior patterns. In this paper, a novel sequential anomaly detection method based on temporal-difference (TD) learning is proposed, where the anomaly detection problem of multi-stage cyber attacks is considered as an application case. A Markov reward process model is presented for the anomaly detection and alarming process of sequential data and it is verified that when the reward function is properly defined, the anomaly probabilities of sequential behaviors are equivalent to the value functions of theMarkov reward process. Therefore, TD learning algorithms in the reinforcement learning literature can be used to efficiently construct anomaly detectionmodels of complex sequential behaviors by estimating the value functions of the Markov reward process. Compared with other machine learning methods for anomaly detection, the proposed approach has the advantage of simplified labeling process using delayed evaluative signals and the prediction accuracy can be improved even if labeled training data are limited. Based on the experimental results on intrusion detection of host computers using system call data, it was shown that the proposed anomaly detectionmethod can achieve higher or at least comparable detection accuracies than other approaches including SVMs, and HMMs. 2009 Elsevier B.V. All rights reserved.
منابع مشابه
Control of Multivariable Systems Based on Emotional Temporal Difference Learning Controller
One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...
متن کاملMachine Learning Techniques for the Domain of Anomaly Detection for Computer Security
In this proposal, we examine the machine learning issues raised by the domain of anomaly detection for computer security. The anomaly detection task is to recognize the presence of an unusual (and potentially hazardous) state within the behaviors or activities of a computer user, system, or network with respect to some model of `normal' behavior which may be either hard-coded or learned from ob...
متن کاملA Comparative Study of Learning and Motivation in Continuing Medical Education Based on Integrated Instructional and Motivational Design Models
Introduction: There are few studies that compare electronic learning in continuing medical education using instructional material developed based on scientific principles of instructional and motivational designs. Therefore, this study was performed in Kermanshah University of Medical Science in 2011 in order to compare physicians’ learning and motivation in these two instructional approaches. ...
متن کاملAnomaly detection in banking operations
This paper presents an overview of anomaly detection algorithms and methodology, focusing on the context of banking operations applications. The main principles of anomaly detection are first presented, followed by listing some of the areas in banking that can benefit from anomaly detection. We then discuss traditional nearest-neighbor and clustering-based approaches. Time series and other sequ...
متن کاملApplying Forward Models to Sequence Learning: a Connectionist Implementation
The ability to process events in their temporal and sequential context is a fundamental skill made mandatory by constant interaction with a dynamic environment. Sequence learning studies have demonstrated that subjects exhibit detailed — and often implicit — sensitivity to the sequential structure of streams of stimuli. Current connectionist models of performance in the so-called Serial Reactio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Appl. Soft Comput.
دوره 10 شماره
صفحات -
تاریخ انتشار 2010